Large-scale structure prediction by improved contact predictions and model quality assessment
نویسندگان
چکیده
Motivation Accurate contact predictions can be used for predicting the structure of proteins. Until recently these methods were limited to very big protein families, decreasing their utility. However, recent progress by combining direct coupling analysis with machine learning methods has made it possible to predict accurate contact maps for smaller families. To what extent these predictions can be used to produce accurate models of the families is not known. Results We present the PconsFold2 pipeline that uses contact predictions from PconsC3, the CONFOLD folding algorithm and model quality estimations to predict the structure of a protein. We show that the model quality estimation significantly increases the number of models that reliably can be identified. Finally, we apply PconsFold2 to 6379 Pfam families of unknown structure and find that PconsFold2 can, with an estimated 90% specificity, predict the structure of up to 558 Pfam families of unknown structure. Out of these, 415 have not been reported before. Availability and Implementation Datasets as well as models of all the 558 Pfam families are available at http://c3.pcons.net/ . All programs used here are freely available. Contact [email protected].
منابع مشابه
PconsFold: improved contact predictions improve protein models
MOTIVATION Recently it has been shown that the quality of protein contact prediction from evolutionary information can be improved significantly if direct and indirect information is separated. Given sufficiently large protein families, the contact predictions contain sufficient information to predict the structure of many protein families. However, since the first studies contact prediction me...
متن کاملQAcon: single model quality assessment using protein structural and contact information with machine learning techniques
Motivation Protein model quality assessment (QA) plays a very important role in protein structure prediction. It can be divided into two groups of methods: single model and consensus QA method. The consensus QA methods may fail when there is a large portion of low quality models in the model pool. Results In this paper, we develop a novel single-model quality assessment method QAcon utilizing...
متن کاملEnhanced Predictions of Tides and Surges through Data Assimilation (TECHNICAL NOTE)
The regional waters in Singapore Strait are characterized by complex hydrodynamic phenomena as a result of the combined effect of three large water bodies viz. the South China Sea, the Andaman Sea, and the Java Sea. This leads to anomalies in water levels and generates residual currents. Numerical hydrodynamic models are generally used for predicting water levels in the ocean and seas. But thei...
متن کاملPerformance of the Pro-sp3-TASSER server in CASP8.
The performance of the protein structure prediction server pro-sp3-TASSER in CASP8 is described. Compared to CASP7, the major improvement in prediction is in the quality of input models to TASSER. These improvements are due to the PRO-SP(3) threading method, the improved quality of contact predictions provided by TASSER_2.0, multiple short TASSER simulations for building the full-length model, ...
متن کاملPrediction of fireball consequences caused by Boilover occurrence in the atmospheric storage tanks
Background and Objectives: Although Boilover occurs with a low frequency, but in case of occurrence, it can cause severe damage to people and equipment around the tank. The prediction of the fireball of Boilover phenomenon has an important role to play in adopting appropriate strategies for fire suppression of the atmospheric storage tank. The purpose of this study is to predict the consequence...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
عنوان ژورنال:
دوره 33 شماره
صفحات -
تاریخ انتشار 2017